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Complete Genome Sequence of Clostridium sp. Strain DL-VIII, a 
Novel Solventogenic Clostridium Species Isolated from Anaerobic 
Sludge 

Safiyh Taghavi, a b Javier A. Izquierdo, 3 Daniel van der Lelie ab 

Center for Agricultural and Environmental Biosoiutions, RTI International, Research Triangle Park, North Carolina, USA a ; Biosciences Department, Brookhaven Nationa 
Laboratory, Upton, New York, USA b 

We report the genome sequence of Clostridium sp. strain DL-VIII, a novel Gram-positive, endospore-forming, solventogenic 
bacterium isolated from activated anaerobic sludge of a wastewater treatment plant. Aside from a complete sol operon, the 
6,477,357-bp genome of DL-VIII reveals genes for several unique enzymes with applications in lignocellulose degradation, in- 
cluding two phenolic acid decarboxylases. 
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Growing demand for alternative liquid fuels has sparked re- 
newed interest in butanol-producing Clostridia such as Clos- 
tridium acetobutylicum and Clostridium beijerinckii, but such 
model organisms that perform acetone-butanol-ethanol (ABE) 
fermentations are limited in number (1). Here we describe the 
genome sequencing of a novel solventogenic Clostridium spe- 
cies isolated from anaerobic sludge, Clostridium sp. strain 
DL-VIII. 

Clostridium sp. DL-VIII genomic DNA was sequenced using 
a combination of Illumina (2) and 454 technologies (3). The 
initial draft assembly contained 89 contigs in one scaffold. The 
454 Titanium and 454 paired-end data were assembled with 
Newbler version 2.3. Newbler consensus sequences were com- 
putationally shredded into 2-kb overlapping shreds. Illumina 
sequencing data were assembled with Velvet, version 1.0.13 
(4), and the consensus sequence was computationally shredded 
into 1.5-kb overlapping shreds. All shreds were integrated with 
the 454 library read pairs using parallel Phrap, version SPS 4.24 
(High Performance Software, LLC). Consed (5-7) was used in 
the finishing process. Illumina data were used to correct poten- 
tial base errors and increase consensus quality using Polisher 
(Alia Lapidus, unpublished). Possible misassemblies were cor- 
rected using gapResolution (C. Han, unpublished), or Dupfin- 
isher (8) or through subcloning. Gaps between contigs were 
closed in Consed, by PCR and Bubble PCR primer walks. The 
final assembly is based on 245.2 Mb of 454 draft data, providing 
an average 38. 6X coverage, and 685.3 Mb of Illumina draft 
data, providing an average 107.8 X coverage of the genome. 

The complete genome is composed of a circular chromosome 
of 6,477,357 bp (30.1% GC content), including 6,230 coding 
genes, 79 tRNAs, and 13 rRNA operons. A total of 5,138 coding 
genes (82.5% of the total) have putative functions assigned on the 
basis of annotation. BLAST results of the 16 rRNA gene operons 
point to C. beijerinckii NCIMB 8052 as the closest relative, with 
98.7% to 99.5% sequence identity. However, there are 1,483 genes 



in DL-VIII (23.8% of the total) not present in C. beijerinckii, while 
the remaining 4747 shared genes between C. beijerinckii and DL- 
VIII have an average amino acid sequence similarity of 78.42%. 
Therefore, Clostridium sp. DL-VIII represents a novel solvento- 
genic species. 

Among the unique genes found in strain DL-VIII are the 
first two phenolic acid decarboxylases detected in a Clostridium 
species (CDLVIII_0111 and CDLVIII_2616), which mayplaya 
role in lignin depolymerization. Likewise, 19 glycosyl hydro- 
lases from 9 families were not found in C. beijerinckii, with 
particular prominence of hemicellulose-specific enzymes from 
families GH10, GH39, and GH43. An analysis of the sol operon 
in this strain reveals distinct sequence differences with closely 
related solventogenic Clostridia, such as C. beijerinckii, C. sac- 
char obutylicum, and C. saccharoperbutylacetonicum, in key 
steps catalyzed by butyraldehyde dehydrogenase (83.8 to 
85.5% sequence identity), 3-oxoacid CoA-transferase (79.6 to 
85.9% identity) and acetoacetate dehydrogenase (91.4 to 92.6% 
identity). 

The genome sequence of Clostridium sp. DL-VIII will contrib- 
ute to our knowledge not only of the limited diversity of solven- 
togenic Clostridia but also of other processes relevant to biofuels 
production, such as lignin depolymerization. 

Nucleotide sequence accession number. The complete ge- 
nome sequence of Clostridium sp. DL-VIII is deposited in 
GenBank under accession number CM001240. 
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